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technology, in particular the widespread adoption of high throughput approaches based 
on pyrosequencing. Inclusion of these new techniques in the Sixth Edition has prompted 
me to completely rewrite the material on DNA sequencing and to place all the relevant 
information — both on the methodology itself and its application to genome sequencing 
— into a single chapter. This has enabled me to devote another entire chapter to the 
post-sequencing methods used to study genomes. The result is, I hope, a more balanced 
treatment of the various aspects of genomics and post-genomics than I had managed in 
previous editions. 

A second important development of the last few years has been the introduction of 
real-time PCR as a means of quantifying the amount of a particular DNA sequence pre- 
sent in a preparation. This technique is now described as part of Chapter 9. Elsewhere, 
I have made various additions, such as inclusion of topoisomerase-based methods for 
blunt end ligation in Chapter 4, and generally tidied up parts of chapters that had 
become slightly unwieldy due to the cumulative effects of modifications made over the 
25 years since the First Edition of this book. The Sixth Edition is almost twice as long 
as the First, but retains the philosophy of that original edition. It is still an introductory 
text that begins at the beginning and does not assume that the reader has any prior 
knowledge of the techniques used to study genes and genomes. 

I would like to thank Nigel Balmforth and Andy Slade at Wiley-Blackwell for help- 
ing me to make this new edition a reality. As always I must also thank my wife Keri for 
the unending support that she has given to me in my decision to use up evenings and 
weekends writing this and other books. 
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Important 



Chapter contents 

The early development of genetics 

The advent of gene cloning and the polymerase chain reaction 
What is gene cloning? 

What is PCR? 

Why gene cloning and PCR are so important 
How to find you way through this book 



In the middle of the 19th century, Gregor Mendel formulated a set of rules to explain 
the inheritance of biological characteristics. The basic assumption of these rules is that 
each heritable property of an organism is controlled by a factor, called a gene, that is a 
physical particle present somewhere in the cell. The rediscovery of Mendel’s laws in 
1900 marks the birth of genetics, the science aimed at understanding what these genes 
are and exactly how they work. 

1.1 The early development of genetics 

For the first 30 years of its life this new science grew at an astonishing rate. The idea 
that genes reside on chromosomes was proposed by W. Sutton in 1903, and received 
experimental backing from T.H. Morgan in 1910. Morgan and his colleagues then 
developed the techniques for gene mapping, and by 1922 had produced a comprehen- 
sive analysis of the relative positions of over 2000 genes on the 4 chromosomes of the 
fruit fly, Drosophila melanogaster. 

Despite the brilliance of these classical genetic studies, there was no real under- 
standing of the molecular nature of the gene until the 1940s. Indeed, it was not until 
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the experiments of Avery, MacLeod, and McCarty in 1944, and of Hershey and Chase 
in 1952, that anyone believed that deoxyribonucleic acid (DNA) is the genetic material: 
up until then it was widely thought that genes were made of protein. The discovery 
of the role of DNA was a tremendous stimulus to genetic research, and many famous 
biologists (Delbriick, Chargaff, Crick, and Monod were among the most influential) 
contributed to the second great age of genetics. In the 14 years between 1952 and 1966, 
the structure of DNA was elucidated, the genetic code cracked, and the processes of 
transcription and translation described. 

1.2 The advent of gene cloning and the polymerase 
chain reaction 

These years of activity and discovery were followed by a lull, a period of anticlimax 
when it seemed to some molecular biologists (as the new generation of geneticists styled 
themselves) that there was little of fundamental importance that was not understood. 
In truth there was a frustration that the experimental techniques of the late 1960s were 
not sophisticated enough to allow the gene to be studied in any greater detail. 

Then in the years 1971-1973 genetic research was thrown back into gear by what at 
the time was described as a revolution in experimental biology. A whole new method- 
ology was developed, enabling previously impossible experiments to be planned and 
carried out, if not with ease, then at least with success. These methods, referred to as 
recombinant DNA technology or genetic engineering, and having at their core the pro- 
cess of gene cloning, sparked another great age of genetics. They led to rapid and 
efficient DNA sequencing techniques that enabled the structures of individual genes to 
be determined, reaching a culmination at the turn of the century with the massive 
genome sequencing projects, including the human project which was completed in 2000. 
They led to procedures for studying the regulation of individual genes, which have 
allowed molecular biologists to understand how aberrations in gene activity can result 
in human diseases such as cancer. The techniques spawned modern biotechnology, 
which puts genes to work in production of proteins and other compounds needed in 
medicine and industrial processes. 

During the 1980s, when the excitement engendered by the gene cloning revolution 
was at its height, it hardly seemed possible that another, equally novel and equally 
revolutionary process was just around the corner. According to DNA folklore, Kary 
Mullis invented the polymerase chain reaction (PCR) during a drive along the coast 
of California one evening in 1985. His brainwave was an exquisitely simple technique 
that acts as a perfect complement to gene cloning. PCR has made easier many of the 
techniques that were possible but difficult to carry out when gene cloning was used on 
its own. It has extended the range of DNA analysis and enabled molecular biology to find 
new applications in areas of endeavor outside of its traditional range of medicine, agri- 
culture, and biotechnology. Archaeogenetics, molecular ecology, and DNA forensics 
are just three of the new disciplines that have become possible as a direct consequence 
of the invention of PCR, enabling molecular biologists to ask questions about human 
evolution and the impact of environmental change on the biosphere, and to bring their 
powerful tools to bear in the fight against crime. Forty years have passed since the dawn- 
ing of the age of gene cloning, but we are still riding the rollercoaster and there is no 
end to the excitement in sight. 
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1.3 What is gene cloning? 

What exactly is gene cloning? The easiest way to answer this question is to follow 

through the steps in a gene cloning experiment (Figure 1.1): 

1 A fragment of DNA, containing the gene to be cloned, is inserted into a circular 
DNA molecule called a vector, to produce a recombinant DNA molecule. 

2 The vector transports the gene into a host cell, which is usually a bacterium, 
although other types of living cell can be used. 

3 Within the host cell the vector multiplies, producing numerous identical copies, 
not only of itself but also of the gene that it carries. 

4 When the host cell divides, copies of the recombinant DNA molecule are passed to 
the progeny and further vector replication takes place. 

5 After a large number of cell divisions, a colony, or clone, of identical host cells is 
produced. Each cell in the clone contains one or more copies of the recombinant 
DNA molecule; the gene carried by the recombinant molecule is now said to be 
cloned. 
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1.4 What is PCR? 

The polymerase chain reaction is very different from gene cloning. Rather than a series 
of manipulations involving living cells, PCR is carried out in a single test tube simply by 
mixing DNA with a set of reagents and placing the tube in a thermal cycler, a piece of 
equipment that enables the mixture to be incubated at a series of temperatures that are 
varied in a preprogrammed manner. The basic steps in a PCR experiment are as follows 
(Figure 1.2): 
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4 Repeat the cycle 25-30 times 
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1 The mixture is heated to 94°C, at which temperature the hydrogen bonds that 
hold together the two strands of the double-stranded DNA molecule are broken, 
causing the molecule to denature. 

2 The mixture is cooled down to 50-60°C. The two strands of each molecule could 
join back together at this temperature, but most do not because the mixture 
contains a large excess of short DNA molecules, called oligonucleotides or primers, 
which anneal to the DNA molecules at specific positions. 

3 The temperature is raised to 74°C. This is a good working temperature for the 
Taq DNA polymerase that is present in the mixture. We will learn more about 
DNA polymerases on p. 48. All we need to understand at this stage is that the 
Taq DNA polymerase attaches to one end of each primer and synthesizes new 
strands of DNA, complementary to the template DNA molecules, during this step 
of the PCR. Now we have four stands of DNA instead of the two that there were 
to start with. 

4 The temperature is increased back to 94°C. The double-stranded DNA 
molecules, each of which consists of one strand of the original molecule and 
one new strand of DNA, denature into single strands. This begins a second cycle 
of denaturation-annealing-synthesis, at the end of which there are eight DNA 
strands. By repeating the cycle 30 times the double-stranded molecule that we 
began with is converted into over 130 million new double-stranded molecules, 
each one a copy of the region of the starting molecule delineated by the annealing 
sites of the two primers. 

1.5 Why gene cloning and PCR are so important 

As you can see from Figures 1.1 and 1.2, gene cloning and PCR are relatively straight- 
forward procedures. Why, then, have they assumed such importance in biology? The 
answer is largely because both techniques can provide a pure sample of an individual 
gene, separated from all the other genes in the cell. 

1.5. 1 Obtaining a pure sample of a gene by cloning 

To understand exactly how cloning can provide a pure sample of a gene, consider the 
basic experiment from Figure 1.1, but drawn in a slightly different way (Figure 1.3). 
In this example the DNA fragment to be cloned is one member of a mixture of many 
different fragments, each carrying a different gene or part of a gene. This mixture could 
indeed be the entire genetic complement of an organism — a human, for instance. Each 
of these fragments becomes inserted into a different vector molecule to produce a 
family of recombinant DNA molecules, one of which carries the gene of interest. Usu- 
ally only one recombinant DNA molecule is transported into any single host cell, so 
that although the final set of clones may contain many different recombinant DNA 
molecules, each individual clone contains multiple copies of just one molecule. The gene 
is now separated away from all the other genes in the original mixture, and its specific 
features can be studied in detail. 

In practice, the key to the success or failure of a gene cloning experiment is the abil- 
ity to identify the particular clone of interest from the many different ones that are 
obtained. If we consider the genome of the bacterium Escherichia coli , which contains 
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just over 4000 different genes, we might at first despair of being able to find just one 
gene among all the possible clones (Figure 1.4). The problem becomes even more over- 
whelming when we remember that bacteria are relatively simple organisms and that the 
human genome contains about five times as many genes. However, as explained in 
Chapter 8, a variety of different strategies can be used to ensure that the correct gene 
can be obtained at the end of the cloning experiment. Some of these strategies involve 
modifications to the basic cloning procedure, so that only cells containing the desired 
recombinant DNA molecule can divide and the clone of interest is automatically 
selected. Other methods involve techniques that enable the desired clone to be identified 
from a mixture of lots of different clones. 

Once a gene has been cloned there is almost no limit to the information that can 
be obtained about its structure and expression. The availability of cloned material has 
stimulated the development of analytical methods for studying genes, with new tech- 
niques being introduced all the time. Methods for studying the structure and expression 
of a cloned gene are described in Chapters 10 and 11, respectively. 
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Figure 1.4 

The problem of selection. 




How can we select or 
identify just one gene? 



1.5.2 PCR can also be used to purify a gene 

The polymerase chain reaction can also be used to obtain a pure sample of a gene. This 
is because the region of the starting DNA molecule that is copied during PCR is the 
segment whose boundaries are marked by the annealing positions of the two oligonu- 
cleotide primers. If the primers anneal either side of the gene of interest, many copies 
of that gene will be synthesized (Figure 1.5). The outcome is the same as with a gene 
cloning experiment, although the problem of selection does not arise because the desired 
gene is automatically “selected” as a result of the positions at which the primers anneal. 

A PCR experiment can be completed in a few hours, whereas it takes weeks if not 
months to obtain a gene by cloning. Why then is gene cloning still used? This is because 
PCR has two limitations: 

In order for the primers to anneal to the correct positions, either side of the gene 
of interest, the sequences of these annealing sites must be known. It is easy to 
synthesize a primer with a predetermined sequence (see p. 139), but if the 
sequences of the annealing sites are unknown then the appropriate primers cannot 
be made. This means that PCR cannot be used to isolate genes that have not been 
studied before — that has to be done by cloning. 
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Figure 1.5 

Gene isolation by PCR. 
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There is a limit to the length of DNA sequence that can be copied by PCR. 

Five kilobases (kb) can be copied fairly easily, and segments up to forty kb can be 
dealt with by using specialized techniques, but this is shorter than the lengths of 
many genes, especially those of humans and other vertebrates. Cloning must be 
used if an intact version of a long gene is required. 

Gene cloning is therefore the only way of isolating long genes or those that have 
never been studied before. But PCR still has many important applications. For example, 
even if the sequence of a gene is not known, it may still be possible to determine the 
appropriate sequences for a pair of primers, based on what is known about the sequence 
of the equivalent gene in a different organism. A gene that has been isolated and 
sequenced from, say, mouse could therefore be used to design a pair of primers for 
isolation of the equivalent gene from humans. 

In addition, there are many applications where it is necessary to isolate or detect 
genes whose sequences are already known. A PCR of human globin genes, for example, 
is used to test for the presence of mutations that might cause the blood disease called 
thalassaemia. Design of appropriate primers for this PCR is easy because the sequences 
of the human globin genes are known. After the PCR, the gene copies are sequenced or 
studied in some other way to determine if any of the thalassaemia mutations are present. 

Another clinical application of PCR involves the use of primers specific for the DNA 
of a disease-causing virus. A positive result indicates that a sample contains the virus and 
that the person who provided the sample should undergo treatment to prevent onset of 
the disease. The polymerase chain reaction is tremendously sensitive: a carefully set up 
reaction yields detectable amounts of DNA, even if there is just one DNA molecule in 
the starting mixture. This means that the technique can detect viruses at the earliest 
stages of an infection, increasing the chances of treatment being successful. This great 
sensitivity means that PCR can also be used with DNA from forensic material such as 
hairs and dried bloodstains or even from the bones of long-dead humans (Chapter 16). 
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1 .6 How to find your way through this book 

This book explains how gene cloning, PCR and other DNA analysis techniques 
are carried out and describes the applications of these techniques in modern biology. 
The applications are covered in the second and third parts of the book. Part II describes 
how genes and genomes are studied and Part III gives accounts of the broader applica- 
tions of gene cloning and PCR in biotechnology, medicine, agriculture, and forensic 
science. 

In Part I we deal with the basic principles. Most of the nine chapters are devoted to 
gene cloning because this technique is more complicated than PCR. When you have 
understood how cloning is carried out you will have understood many of the basic 
principles of how DNA is analyzed. In Chapter 2 we look at the central component of 
a gene cloning experiment — the vector — which transports the gene into the host cell and 
is responsible for its replication. To act as a cloning vector a DNA molecule must be 
capable of entering a host cell and, once inside, replicating to produce multiple copies 
of itself. Two naturally occurring types of DNA molecule satisfy these requirements: 

• Plasmids, which are small circles of DNA found in bacteria and some 
other organisms. Plasmids can replicate independently of the host cell 
chromosome. 

Virus chromosomes, in particular the chromosomes of bacteriophages, which are 
viruses that specifically infect bacteria. During infection the bacteriophage DNA 
molecule is injected into the host cell where it undergoes replication. 

Chapter 3 describes how DNA is purified from living cells — both the DNA that 
will be cloned and the vector DNA — and Chapter 4 covers the various techniques for 
handling purified DNA molecules in the laboratory. There are many such techniques, 
but two are particularly important in gene cloning. These are the ability to cut the 
vector at a specific point and then to repair it in such a way that the gene is inserted 
(see Figure 1.1). These and other DNA manipulations were developed as an offshoot of 
basic research into DNA synthesis and modification in living cells, and most of the 
manipulations make use of purified enzymes. The properties of these enzymes, and the 
way they are used in DNA studies, are described in Chapter 4. 

Once a recombinant DNA molecule has been constructed, it must be introduced into 
the host cell so that replication can take place. Transport into the host cell makes use 
of natural processes for uptake of plasmid and viral DNA molecules. These processes 
and the ways they are utilized in gene cloning are described in Chapter 5, and the 
most important types of cloning vector are introduced, and their uses examined, in 
Chapters 6 and 7. To conclude the coverage of gene cloning, in Chapter 8 we investi- 
gate the problem of selection (see Figure 1.4), before returning in Chapter 9 to a more 
detailed description of PCR and its related techniques. 
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Bacteriophages 
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A DNA molecule needs to display several features to be able to act as a vector for 
gene cloning. Most importantly it must be able to replicate within the host cell, so that 
numerous copies of the recombinant DNA molecule can be produced and passed to the 
daughter cells. A cloning vector also needs to be relatively small, ideally less than 10 kb 
in size, as large molecules tend to break down during purification, and are also more 
difficult to manipulate. Two kinds of DNA molecule that satisfy these criteria can be 
found in bacterial cells: plasmids and bacteriophage chromosomes. 



2.1 Plasmids 

Plasmids are circular molecules of DNA that lead an independent existence in the 
bacterial cell (Figure 2.1). Plasmids almost always carry one or more genes, and often 
these genes are responsible for a useful characteristic displayed by the host bacterium. 
For example, the ability to survive in normally toxic concentrations of antibiotics 
such as chloramphenicol or ampicillin is often due to the presence in the bacterium of 
a plasmid carrying antibiotic resistance genes. In the laboratory, antibiotic resistance is 
often used as a selectable marker to ensure that bacteria in a culture contain a particu- 
lar plasmid (Figure 2.2). 

Most plasmids possess at least one DNA sequence that can act as an origin of replica- 
tion, so they are able to multiply within the cell independently of the main bacterial 



Gene Cloning and DNA Analysis: An Introduction. 6 th edition. By T.A. Brown. Published 2010 by 
Blackwell Publishing. 



13 



14 



art I The Basic Principles of Gene Cloning and DNA Analysis 



Figure 2.1 

Plasmids: independent genetic elements found in 
bacterial cells. 




Figure 2.2 

The use of antibiotic resistance as a selectable marker 
for a plasmid. RP4 (top) carries genes for resistance to 
ampicillin, tetracycline and kanamycin. Only those E. coli 
cells that contain RP4 (or a related plasmid) are able to 
survive and grow in a medium that contains toxic 
amounts of one or more of these antibiotics. 
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chromosome (Figure 2.3a). The smaller plasmids make use of the host cell’s own DNA 
replicative enzymes in order to make copies of themselves, whereas some of the larger 
ones carry genes that code for special enzymes that are specific for plasmid replication. 
A few types of plasmid are also able to replicate by inserting themselves into the bac- 
terial chromosome (Figure 2.3b). These integrative plasmids or episomes may be stably 
maintained in this form through numerous cell divisions, but always at some stage exist 
as independent elements. 
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Figure 2.3 

Replication strategies for (a) a non-integrative plasmid, and (b) an episome. 



Table 2.1 

Sizes of representative plasmids. 



PLASMID 




SIZE 




ORGANISM 


NUCLEOTIDE LENGTH 
(kb) 


MOLECULAR MASS 
(MDa) 


pUC8 


2.1 




1.8 


E. coli 


ColEI 


6.4 




4.2 


E. coli 


RP4 


54.0 




36.0 


Pseudomonas and others 


F 


95.0 




63.0 


E. coli 


TOL 


117.0 




78.0 


Pseudomonas putida 


pTiAch5 


213.0 




142.0 


Agrobacterium tumefaciens 



2. 1. 1 Size and copy number 

The size and copy number of a plasmid are particularly important as far as cloning is 
concerned. We have already mentioned the relevance of plasmid size and stated that less 
than 10 kb is desirable for a cloning vector. Plasmids range from about 1.0 kb for the 
smallest to over 250 kb for the largest plasmids (Table 2.1), so only a few are useful for 
cloning purposes. However, as we will see in Chapter 7, larger plasmids can be adapted 
for cloning under some circumstances. 

The copy number refers to the number of molecules of an individual plasmid that are 
normally found in a single bacterial cell. The factors that control copy number are not 
well understood. Some plasmids, especially the larger ones, are stringent and have a 
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Figure 2.4 

Plasmid transfer by conjugation between bacterial 
cells. The donor and recipient cells attach to each 
other by a pilus, a hollow appendage present 
on the surface of the donor cell. A copy of the 
plasmid is then passed to the recipient cell. 
Transfer is thought to occur through the pilus, 
but this has not been proven and transfer by 
some other means (e.g. directly across the 
bacterial cell walls) remains a possibility. 



Donor cell Conjugative Recipient cell 




low copy number of perhaps just one or two per cell; others, called relaxed plasmids, 
are present in multiple copies of 50 or more per cell. Generally speaking, a useful 
cloning vector needs to be present in the cell in multiple copies so that large quantities 
of the recombinant DNA molecule can be obtained. 

2. 1.2 Conjugation and compatibility 

Plasmids fall into two groups: conjugative and non-conjugative. Conjugative plasmids 
are characterized by the ability to promote sexual conjugation between bacterial cells 
(Figure 2.4), a process that can result in a conjugative plasmid spreading from one cell 
to all the other cells in a bacterial culture. Conjugation and plasmid transfer are con- 
trolled by a set of transfer or tra genes, which are present on conjugative plasmids but 
absent from the non-conjugative type. However, a non-conjugative plasmid may, under 
some circumstances, be cotransferred along with a conjugative plasmid when both are 
present in the same cell. 

Several different kinds of plasmid may be found in a single cell, including more than 
one different conjugative plasmid at any one time. In fact, cells of E. coli have been 
known to contain up to seven different plasmids at once. To be able to coexist in the 
same cell, different plasmids must be compatible. If two plasmids are incompatible then 
one or the other will be rapidly lost from the cell. Different types of plasmid can there- 
fore be assigned to different incompatibility groups on the basis of whether or not they 
can coexist, and plasmids from a single incompatibility group are often related to each 
other in various ways. The basis of incompatibility is not well understood, but events 
during plasmid replication are thought to underlie the phenomenon. 

2. 1.3 Plasmid classification 

The most useful classification of naturally occurring plasmids is based on the main char- 
acteristic coded by the plasmid genes. The five major types of plasmid according to this 
classification are as follows: 

Fertility or F plasmids carry only tra genes and have no characteristic beyond 

the ability to promote conjugal transfer of plasmids. A well-known example is the 

F plasmid of E. coli. 
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Resistance or R plasmids carry genes conferring on the host bacterium resistance 
to one or more antibacterial agents, such as chloramphenicol, ampicillin, and 
mercury. R plasmids are very important in clinical microbiology as their spread 
through natural populations can have profound consequences in the treatment 
of bacterial infections. An example is RP4, which is commonly found in 
Pseudomonas , but also occurs in many other bacteria. 

Col plasmids code for colicins, proteins that kill other bacteria. An example is 
ColEl of E. coli. 

• Degradative plasmids allow the host bacterium to metabolize unusual molecules 
such as toluene and salicylic acid, an example being TOL of Pseudomonas putida. 
Virulence plasmids confer pathogenicity on the host bacterium; these include the 
Ti plasmids of Agrobacterium tumefaciens , which induce crown gall disease on 
dicotyledonous plants. 

2. 7.4 Plasmids in organisms other than bacteria 

Although plasmids are widespread in bacteria they are by no means as common in other 
organisms. The best characterized eukaryotic plasmid is the 2 pm circle that occurs in 
many strains of the yeast Saccharomyces cerevisiae. The discovery of the 2 pm plasmid 
was very fortuitous as it allowed the construction of cloning vectors for this very import- 
ant industrial organism (p. 105). However, the search for plasmids in other eukaryotes 
(such as filamentous fungi, plants and animals) has proved disappointing, and it is sus- 
pected that many higher organisms simply do not harbor plasmids within their cells. 

2.2 Bacteriophages 

Bacteriophages, or phages as they are commonly known, are viruses that specifically 
infect bacteria. Like all viruses, phages are very simple in structure, consisting merely of 
a DNA (or occasionally ribonucleic acid (RNA)) molecule carrying a number of genes, 
including several for replication of the phage, surrounded by a protective coat or capsid 
made up of protein molecules (Figure 2.5). 



(a) Head-and-tail 




(b) Filamentous 




Figure 2.5 

The two main types of phage structure: (a) head-and- 
tail (e.g. I)] (b) filamentous (e.g. Ml 3). 
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Figure 2.6 



The general pattern of infection of a bacterial cell by a bacteriophage. 



2.2. 1 The phage infection cycle 

The general pattern of infection, which is the same for all types of phage, is a three-step 
process (Figure 2.6): 

1 The phage particle attaches to the outside of the bacterium and injects its DNA 
chromosome into the cell. 

2 The phage DNA molecule is replicated, usually by specific phage enzymes coded by 
genes in the phage chromosome. 

3 Other phage genes direct synthesis of the protein components of the capsid, and 
new phage particles are assembled and released from the bacterium. 

With some phage types the entire infection cycle is completed very quickly, possibly 
in less than 20 minutes. This type of rapid infection is called a lytic cycle, as release of 
the new phage particles is associated with lysis of the bacterial cell. The characteristic 
feature of a lytic infection cycle is that phage DNA replication is immediately followed 
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by synthesis of capsid proteins, and the phage DNA molecule is never maintained in a 
stable condition in the host cell. 

2.2.2 Lysogenic phages 

In contrast to a lytic cycle, lysogenic infection is characterized by retention of the phage 
DNA molecule in the host bacterium, possibly for many thousands of cell divisions. 
With many lysogenic phages the phage DNA is inserted into the bacterial genome, in a 
manner similar to episomal insertion (see Figure 2.3b). The integrated form of the phage 
DNA (called the prophage) is quiescent, and a bacterium (referred to as a lysogen) that 
carries a prophage is usually physiologically indistinguishable from an uninfected cell. 
However, the prophage is eventually released from the host genome and the phage 
reverts to the lytic mode and lyses the cell. The infection cycle of ambda (A,), a typical 
lysogenic phage of this type, is shown in Figure 2.7. 

A limited number of lysogenic phages follow a rather different infection cycle. When 
Ml 3 or a related phage infects E. coli , new phage particles are continuously assembled 
and released from the cell. The M13 DNA is not integrated into the bacterial genome 
and does not become quiescent. With these phages, cell lysis never occurs, and the 
infected bacterium can continue to grow and divide, albeit at a slower rate than 
uninfected cells. Figure 2.8 shows the Ml 3 infection cycle. 

Although there are many different varieties of bacteriophage, only A and Ml 3 have 
found a major role as cloning vectors. We will now consider the properties of these two 
phages in more detail. 

Gene organization in the A DNA motecute 

A is a typical example of a head-and-tail phage (see Figure 2.5a). The DNA is contained 
in the polyhedral head structure and the tail serves to attach the phage to the bacterial 
surface and to inject the DNA into the cell (see Figure 2.7). 

The A DNA molecule is 49 kb in size and has been intensively studied by the tech- 
niques of gene mapping and DNA sequencing. As a result the positions and identities 
of all of the genes in the A DNA molecule are known (Figure 2.9). A feature of the A 
genetic map is that genes related in terms of function are clustered together in the 
genome. For example, all of the genes coding for components of the capsid are grouped 
together in the left-hand third of the molecule, and genes controlling integration of the 
prophage into the host genome are clustered in the middle of the molecule. Clustering 
of related genes is profoundly important for controlling expression of the A genome, as 
it allows genes to be switched on and off as a group rather than individually. Clustering 
is also important in the construction of A-based cloning vectors, as we will discover 
when we return to this topic in Chapter 6. 

The linear and circular forms of A DNA 

A second feature of A that turns out to be of importance in the construction of cloning 
vectors is the conformation of the DNA molecule. The molecule shown in Figure 2.9 is 
linear, with two free ends, and represents the DNA present in the phage head struc- 
ture. This linear molecule consists of two complementary strands of DNA, base-paired 
according to the Watson-Crick rules (that is, double-stranded DNA). However, at either 
end of the molecule is a short 12-nucleotide stretch in which the DNA is single-stranded 
(Figure 2.10a). The two single strands are complementary, and so can base pair with one 
another to form a circular, completely double-stranded molecule (Figure 2.10b). 
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Figure 2.7 

The lysogenic infection cycle of bacteriophage X. 



Complementary single strands are often referred to as “sticky” ends or cohesive ends, 
because base pairing between them can “stick” together the two ends of a DNA molecule 
(or the ends of two different DNA molecules). The X cohesive ends are called the cos 
sites and they play two distinct roles during the X infection cycle. First, they allow the 
linear DNA molecule that is injected into the cell to be circularized, which is a neces- 
sary prerequisite for insertion into the bacterial genome (see Figure 2.7). 
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The infection cycle of bacteriophage Ml 3. 
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Figure 2.9 

The X genetic map, showing the positions of the important genes and the functions of the gene clusters. 



The second role of the cos sites is rather different, and comes into play after the 
prophage has excised from the host genome. At this stage a large number of new X DNA 
molecules are produced by the rolling circle mechanism of replication (Figure 2.10c), 
in which a continuous DNA strand is “rolled off” the template molecule. The result 
is a catenane consisting of a series of linear X genomes joined together at the cos sites. 
The role of the cos sites is now to act as recognition sequences for an endonuclease 
that cleaves the catenane at the cos sites, producing individual X genomes. This endonu- 
clease, which is the product of gene A on the X DNA molecule, creates the single- 
stranded sticky ends, and also acts in conjunction with other proteins to package each 
X genome into a phage head structure. The cleavage and packaging processes recognize 
just the cos sites and the DNA sequences to either side of them, so changing the 
structure of the internal regions of the X genome, for example by inserting new genes, 
has no effect on these events so long as the overall length of the X genome is not altered 
too greatly. 
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(a) The linear form of the X DNA molecule 



GGGCGGCGACCT 
Left cohesive end 



Right cohesive end 
CCCGCCGCTGGA 



(b) The circular form of 
the X DNA molecule 



cos site 




(c) Replication and packaging of X DNA 





New phage particles 
are assembled 

Figure 2.10 

The linear and circular forms of X DNA. (a) The linear form, showing the left and right cohesive ends, (b) Base pairing 
between the cohesive ends results in the circular form of the molecule, (c) Rolling circle replication produces a catenane 
of new linear X DNA molecules, which are individually packaged into phage heads as new X particles are assembled. 



Ml 3— a filamentous phage 

M13 is an example of a filamentous phage (see Figure 2.5b) and is completely different 
in structure from X. Furthermore, the M13 DNA molecule is much smaller than the 
X genome, being only 6407 nucleotides in length. It is circular and is unusual in that it 
consists entirely of single-stranded DNA. 

The smaller size of the M13 DNA molecule means that it has room for fewer genes 
than the X genome. This is possible because the M13 capsid is constructed from multiple 
copies of just three proteins (requiring only three genes), whereas synthesis of the X 
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(a) Injection of single- 
stranded DNA into 
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second strand 
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Figure 2.11 

The Ml 3 infection cycle, showing the different 
types of DNA replication that occur, (a) After 
infection the single-stranded Ml 3 DNA molecule 
is converted into the double-stranded replicative 
form (RF). (b) The RF replicates to produce 
multiple copies of itself, (c) Single-stranded 
molecules are synthesized by rolling circle 
replication and used in the assembly of new 
Ml 3 particles. 



(b) Replication of the 
RF to produce new 
double-stranded 
molecules 




(c) Mature Ml 3 phage 
are continuously 
produced 

RF replicates by rolling 
circle mechanism to 
produce linear 
single-stranded DNA 




head-and-tail structure involves over 15 different proteins. In addition, M13 follows a 
simpler infection cycle than A, and does not need genes for insertion into the host 
genome. 

Injection of an M13 DNA molecule into an E. coli cell occurs via the pilus, the 
structure that connects two cells during sexual conjugation (see Figure 2.4). Once inside 
the cell the single-stranded molecule acts as the template for synthesis of a comple- 
mentary strand, resulting in normal double-stranded DNA (Figure 2.11a). This molecule 
is not inserted into the bacterial genome, but instead replicates until over 100 copies are 
present in the cell (Figure 2.11b). When the bacterium divides, each daughter cell 
receives copies of the phage genome, which continues to replicate, thereby maintaining 
its overall numbers per cell. As shown in Figure 2.11c, new phage particles are continu- 
ously assembled and released, about 1000 new phages being produced during each 
generation of an infected cell. 

Several features of M13 make this phage attractive as a cloning vector. The genome 
is less than 10 kb in size, well within the range desirable for a potential vector. In addi- 
tion, the double-stranded replicative form (RF) of the Ml 3 genome behaves very much 
like a plasmid, and can be treated as such for experimental purposes. It is easily prepared 
from a culture of infected E. coli cells (p. 43) and can be reintroduced by transfection 
(p. 81). Most importantly, genes cloned with an M13-based vector can be obtained in 
the form of single-stranded DNA. Single-stranded versions of cloned genes are useful 
for several techniques, notably DNA sequencing and in vitro mutagenesis (pp. 169 and 
203). Cloning in an M13 vector is an easy and reliable way of obtaining single-stranded 
DNA for this type of work. Ml 3 vectors are also used in phage display, a technique for 
identifying pairs of genes whose protein products interact with one another (p. 220). 
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2.2.3 Viruses as cloning vectors for other organisms 

Most living organisms are infected by viruses and it is not surprising that there has been 
great interest in the possibility that viruses might be used as cloning vectors for higher 
organisms. This is especially important when it is remembered that plasmids are not 
commonly found in organisms other than bacteria and yeast. Several eukaryotic viruses 
have been employed as cloning vectors for specialized applications: for example, human 
adenoviruses are used in gene therapy (p. 259), baculoviruses are used to synthesize 
important pharmaceutical proteins in insect cells (p. 240), and caulimoviruses and 
geminiviruses have been used for cloning in plants (p. 120). These vectors are discussed 
more fully in Chapter 7. 
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The genetic engineer will, at different times, need to prepare at least three distinct kinds 
of DNA. First, total cell DNA will often be required as a source of material from which 
to obtain genes to be cloned. Total cell DNA may be DNA from a culture of bacteria, 
from a plant, from animal cells, or from any other type of organism that is being studied. 

It consists of the genomic DNA of the organism along with any additional DNA mole- 
cules, such as plasmids, that are present. 

The second type of DNA that will be required is pure plasmid DNA. Preparation of 
plasmid DNA from a culture of bacteria follows the same basic steps as purification of 
total cell DNA, with the crucial difference that at some stage the plasmid DNA must be 
separated from the main bulk of chromosomal DNA also present in the cell. 

Finally, phage DNA will be needed if a phage cloning vector is to be used. Phage 
DNA is generally prepared from bacteriophage particles rather than from infected 
cells, so there is no problem with contaminating bacterial DNA. However, special 
techniques are needed to remove the phage capsid. An exception is the double-stranded 
replicative form of M13, which is prepared from E. coli cells in the same way as a 
bacterial plasmid. 

3.1 Preparation of total cell DNA 

The fundamentals of DNA preparation are most easily understood by first considering 
the simplest type of DNA purification procedure, that where the entire DNA complement 
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Figure 3.1 

The basic steps in preparation of total cell DNA from a culture of bacteria. 



of a bacterial cell is required. The modifications needed for plasmid and phage DNA 
preparation can then be described later. 

The procedure for total DNA preparation from a culture of bacterial cells can be 
divided into four stages (Figure 3.1): 

1 A culture of bacteria is grown and then harvested. 

2 The cells are broken open to release their contents. 

3 This cell extract is treated to remove all components except the DNA. 

4 The resulting DNA solution is concentrated. 

3. 1. 1 Growing and harvesting a bacterial culture 

Most bacteria can be grown without too much difficulty in a liquid medium (broth 
culture). The culture medium must provide a balanced mixture of the essential nutrients 
at concentrations that will allow the bacteria to grow and divide efficiently. Two typical 
growth media are detailed in Table 3.1. 

M9 is an example of a defined medium in which all the components are known. This 
medium contains a mixture of inorganic nutrients to provide essential elements such as 



Table 3.1 

The composition of two typical media for the growth of bacterial cultures. 
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nitrogen, magnesium, and calcium, as well as glucose to supply carbon and energy. In 
practice, additional growth factors such as trace elements and vitamins must be added 
to M9 before it will support bacterial growth. Precisely which supplements are needed 
depends on the species concerned. 

The second medium described in Table 3.1 is rather different. Luria-Bertani (LB) is 
a complex or undefined medium, meaning that the precise identity and quantity of its 
components are not known. This is because two of the ingredients, tryptone and yeast 
extract, are complicated mixtures of unknown chemical compounds. Tryptone in fact 
supplies amino acids and small peptides, while yeast extract (a dried preparation of 
partially digested yeast cells) provides the nitrogen requirements, along with sugars 
and inorganic and organic nutrients. Complex media such as LB need no further supple- 
mentation and support the growth of a wide range of bacterial species. 

Defined media must be used when the bacterial culture has to be grown under 
precisely controlled conditions. However, this is not necessary when the culture is being 
grown simply as a source of DNA, and under these circumstances a complex medium 
is appropriate. In LB medium at 37°C, aerated by shaking at 150-250 rpm on a rotary 
platform, E. coli cells divide once every 20 min or so until the culture reaches a 
maximum density of about 2-3 x 10 9 cells/ml. The growth of the culture can be mon- 
itored by reading the optical density (OD) at 600 nm (Figure 3.2), at which wavelength 
1 OD unit corresponds to about 0.8 x 10 9 cells/ml. 

In order to prepare a cell extract, the bacteria must be obtained in as small a volume 
as possible. Harvesting is therefore performed by spinning the culture in a centrifuge 
(Figure 3.3). Fairly low centrifugation speeds will pellet the bacteria at the bottom 
of the centrifuge tube, allowing the culture medium to be poured off. Bacteria from a 
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Figure 3.2 

Estimation of bacterial cell number by 
measurement of optical density (0D). (a) A 
sample of the culture is placed in a glass cuvette 
and light with a wavelength of 600 nm shone 
through. The amount of light that passes 
through the culture is measured and the 0D 
(also called the absorbance) calculated as: 

1 0D unit = | 0g intensity of transmitted light 
intensity of incident light 

The operation is performed with a 
spectrophotometer, (b) The cell number 
corresponding to the 0D reading is calculated 
from a calibration curve. This curve is plotted 
from the 0D values of a series of cultures 
of known cell density. ForE coli, 

1 0D unit = 0.8 xIO 9 cells/ml. 
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Figure 3.3 

Harvesting bacteria by centrifugation. 



1000 ml culture at maximum cell density can then be resuspended into a volume of 
10 ml or less. 

3. 7.2 Preparation of a cell extract 

The bacterial cell is enclosed in a cytoplasmic membrane and surrounded by a rigid cell 
wall. With some species, including E. coli , the cell wall may itself be enveloped by a 
second, outer membrane. All of these barriers have to be disrupted to release the cell 
components. 

Techniques for breaking open bacterial cells can be divided into physical methods, 
in which the cells are disrupted by mechanical forces, and chemical methods, where cell 
lysis is brought about by exposure to chemical agents that affect the integrity of the cell 
barriers. Chemical methods are most commonly used with bacterial cells when the object 
is DNA preparation. 

Chemical lysis generally involves one agent attacking the cell wall and another 
disrupting the cell membrane (Figure 3.4a). The chemicals that are used depend on the 
species of bacterium involved, but with E. coli and related organisms, weakening of 
the cell wall is usually brought about by lysozyme, ethylenediamine tetraacetate (EDTA), 
or a combination of both. Lysozyme is an enzyme that is present in egg white and in 
secretions such as tears and saliva, and which digests the polymeric compounds that 
give the cell wall its rigidity. EDTA removes magnesium ions that are essential for pre- 
serving the overall structure of the cell envelope, and also inhibits cellular enzymes that 
could degrade DNA. Under some conditions, weakening the cell wall with lysozyme or 
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Figure 3.4 

Preparation of a cell extract, (a) Cell lysis, (b) Centrifugation of the cell extract to remove insoluble debris. 
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EDTA is sufficient to cause bacterial cells to burst, but usually a detergent such as sodium 
dodecyl sulphate (SDS) is also added. Detergents aid the process of lysis by removing 
lipid molecules and thereby cause disruption of the cell membranes. 

Having lysed the cells, the final step in preparation of a cell extract is removal of 
insoluble cell debris. Components such as partially digested cell wall fractions can be 
pelleted by centrifugation (Figure 3.4b), leaving the cell extract as a reasonably clear 
supernatant. 

3. 1.3 Purification of DNA from a cell extract 

In addition to DNA, a bacterial cell extract contains significant quantities of protein 
and RNA. A variety of methods can be used to purify the DNA from this mixture. One 
approach is to treat the mixture with reagents which degrade the contaminants, 
leaving a pure solution of DNA (Figure 3.5a). Other methods use ion-exchange 
chromatography to separate the mixture into its various components, so the DNA is 
removed from the proteins and RNA in the extract (Figure 3.5b). 

Removing contaminants by organic extraction and enzyme digestion 

The standard way to deproteinize a cell extract is to add phenol or a 1 : 1 mixture of 
phenol and chloroform. These organic solvents precipitate proteins but leave the nucleic 
acids (DNA and RNA) in aqueous solution. The result is that if the cell extract is mixed 
gently with the solvent, and the layers then separated by centrifugation, precipitated 
protein molecules are left as a white coagulated mass at the interface between the 
aqueous and organic layers (Figure 3.6). The aqueous solution of nucleic acids can then 
be removed with a pipette. 

With some cell extracts the protein content is so great that a single phenol extraction 
is not sufficient to completely purify the nucleic acids. This problem could be solved by 
carrying out several phenol extractions one after the other, but this is undesirable as 
each mixing and centrifugation step results in a certain amount of breakage of the DNA 
molecules. The answer is to treat the cell extract with a protease such as pronase or 
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Figure 3.5 

Two approaches to DNA purification, (a) Treating 
the mixture with reagents which degrade the 
contaminants, leaving a pure solution of DNA. 

(b) Separating the mixture into different fractions, 
one of which is pure DNA. 
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Figure 3.6 

Removal of protein contaminants by phenol 
extraction. 
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proteinase K before phenol extraction. These enzymes break polypeptides down into 
smaller units, which are more easily removed by phenol. 

Some RNA molecules, especially messenger RNA (mRNA), are removed by phenol 
treatment, but most remain with the DNA in the aqueous layer. The only effective way 
to remove the RNA is with the enzyme ribonuclease, which rapidly degrades these 
molecules into ribonucleotide subunits. 

Using ion-exchange chromatography to purify DNA from a cett extract 

Biochemists have devised various methods for using differences in electrical charge to 
separate mixtures of chemicals into their individual components. One of these methods 
is ion-exchange chromatography, which separates molecules according to how tightly 
they bind to electrically charged particles present in a chromatographic matrix or 
resin. DNA and RNA are both negatively charged, as are some proteins, and so bind to 
a positively charged resin. The electrical attachment is disrupted by salt (Figure 3.7a), 
removal of the more tightly bound molecules requiring higher concentrations of salt. By 
gradually increasing the salt concentration, different types of molecule can be detached 
from the resin one after another. 

The simplest way to carry out ion-exchange chromatography is to place the resin in 
a glass or plastic column and then add the cell extract to the top (Figure 3.7b). The 
extract passes through the column, and because this extract contains very little salt all 
the negatively charged molecules bind to the resin and are retained in the column. If a 
salt solution of gradually increasing concentration is now passed through the column, 
the different types of molecule will elute (i.e., become unbound) in the sequence protein, 
RNA, and finally DNA. However, such careful separation is usually not needed so just 
two salt solutions are used, one whose concentration is sufficient to elute the protein and 
RNA, leaving just the DNA bound, followed by a second of a higher concentration 
which elutes the DNA, now free from protein and RNA contaminants. 

3. 7.4 Concentration of DNA samples 

Organic extraction often results in a very thick solution of DNA that does not need to 
be concentrated any further. Other purification methods give more dilute solutions and 
it is therefore important to consider methods for increasing the DNA concentration. 

The most frequently used method of concentration is ethanol precipitation. In the 
presence of salt (strictly speaking, monovalent cations such as sodium ions (Na + )), and 
at a temperature of -20°C or less, absolute ethanol efficiently precipitates polymeric 
nucleic acids. With a thick solution of DNA the ethanol can be layered on top of the 
sample, causing molecules to precipitate at the interface. A spectacular trick is to 
push a glass rod through the ethanol into the DNA solution. When the rod is removed, 
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(a) Attachment of DNA to ion-exchange particles 
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Figure 3.7 

DNA purification by ion-exchange 
chromatography, (a) Attachment of DNA to ion- 
exchange particles, (b) DNA is purified by column 
chromatography. The solutions passing through 
the column can be collected by gravity flow or by 
the spin column method, in which the column is 
placed in a low-speed centrifuge. 
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DNA molecules adhere and can be pulled out of the solution in the form of a long fiber 
(Figure 3.8a). Alternatively, if ethanol is mixed with a dilute DNA solution, the precipit- 
ate can be collected by centrifugation (Figure 3.8b), and then redissolved in an appropriate 
volume of water. Ethanol precipitation has the added advantage of leaving short-chain 
and monomeric nucleic acid components in solution. Ribonucleotides produced by 
ribonuclease treatment are therefore lost at this stage. 

3. 1.5 Measurement of DNA concentration 

It is crucial to know exactly how much DNA is present in a solution when carrying out 
a gene cloning experiment. Fortunately DNA concentrations can be accurately measured 

by ultraviolet (UV) absorbance spectrophotometry. The amount of UV radiation 
absorbed by a solution of DNA is directly proportional to the amount of DNA in the 
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Figure 3.8 

Collecting DNA by ethanol precipitation. 

(a) Absolute ethanol is layered on top of a 
concentrated solution of DNA. Fibers of DNA 
can be withdrawn with a glass rod. (b) For less 
concentrated solutions ethanol is added (at a ratio 
of 2.5 volumes of absolute ethanol to 1 volume of 
DNA solution) and precipitated DNA collected by 
centrifugation. 



t 




j“L 



Ethanol 



Precipitated 
DNA, collected - 
by centrifugation 






(a) 



(b) 



sample. Usually absorbance is measured at 260 nm, at which wavelength an absorbance 
(A 260 ) of 1.0 corresponds to 50 mg of double-stranded DNA per ml. Measurements of 
as little as 1 pi of a DNA solution can be carried out in spectrophotometers designed 
especially for this purpose. 

Ultraviolet absorbance can also be used to check the purity of a DNA preparation. 
With a pure sample of DNA, the ratio of the absorbances at 260 and 280 nm (A 260 /A 280 ) 
is 1.8. Ratios of less than 1.8 indicate that the preparation is contaminated, either with 
protein or with phenol. 

3. 1.6 Other methods for the preparation of total cell DNA 

Bacteria are not the only organisms from which DNA may be required. Total cell DNA 
from, for example, plants or animals will be needed if the aim of the genetic engineering 
project is to clone genes from these organisms. Although the basic steps in DNA 
purification are the same whatever the organism, some modifications may have to be 
introduced to take account of the special features of the cells being used. 

Obviously growth of cells in liquid medium is appropriate only for bacteria, other 
microorganisms, and plant and animal cell cultures. The major modifications, however, 
are likely to be needed at the cell breakage stage. The chemicals used for disrupting 
bacterial cells do not usually work with other organisms: lysozyme, for example, has no 
effect on plant cells. Specific degradative enzymes are available for most cell wall types, 
but often physical techniques, such as grinding frozen material with a mortar and 
pestle, are more efficient. On the other hand, most animal cells have no cell wall at all, 
and can be lysed simply by treating with detergent. 

Another important consideration is the biochemical content of the cells from which 
DNA is being extracted. With most bacteria the main biochemicals present in a cell 
extract are protein, DNA and RNA, so phenol extraction and/or protease treatment, 
followed by removal of RNA with ribonuclease, leaves a pure DNA sample. These 
treatments may not, however, be sufficient to give pure DNA if the cells also contain 
significant quantities of other biochemicals. Plant tissues are particularly difficult in this 
respect as they often contain large amounts of carbohydrates that are not removed by 
phenol extraction. Instead a different approach must be used. One method makes use 
of a detergent called cetyltrimethylammonium bromide (CTAB), which forms an 
insoluble complex with nucleic acids. When CTAB is added to a plant cell extract the 
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Figure 3.9 

The CTAB method for purification of plant DNA. 



Ethanol 

precipitation, 

ribonuclease 

treatment 




Pure 

DNA 



nucleic acid-CTAB complex precipitates, leaving carbohydrate, protein, and other 
contaminants in the supernatant (Figure 3.9). The precipitate is then collected by 
centrifugation and resuspended in 1 M sodium chloride, which causes the complex to 
break down. The nucleic acids can now be concentrated by ethanol precipitation and 
the RNA removed by ribonuclease treatment. 

The need to adapt organic extraction methods to take account of the biochemical 
contents of different types of starting material has stimulated the search for DNA 
purification methods that can be used with any species. This is one of the reasons why 
ion-exchange chromatography has become so popular. A similar method involves 
a compound called guanidinium thiocyanate, which has two properties that make it 
useful for DNA purification. First, it denatures and dissolves all biochemicals other 
than nucleic acids and can therefore be used to release DNA from virtually any type of 
cell or tissue. Second, in the presence of guanidinium thiocyanate, DNA binds tightly 
to silica particles (Figure 3.10a). This provides an easy way of recovering the DNA from 
the denatured mix of biochemicals. One possibility is to add the silica directly to the 
cell extract but, as with the ion-exchange methods, it is more convenient to use a 
chromatography column. The silica is placed in the column and the cell extract added 
(Figure 3.10b). DNA binds to the silica and is retained in the column, whereas the 
denatured biochemicals pass straight through. After washing away the last contaminants 
with guanidinium thiocyanate solution, the DNA is recovered by adding water, which 
destabilizes the interactions between the DNA molecules and the silica. 

3.2 Preparation of plasmid DNA 

Purification of plasmids from a culture of bacteria involves the same general strategy as 
preparation of total cell DNA. A culture of cells, containing plasmids, is grown in liquid 
medium, harvested, and a cell extract prepared. The protein and RNA are removed, 
and the DNA probably concentrated by ethanol precipitation. However, there is an 
important distinction between plasmid purification and preparation of total cell DNA. 
In a plasmid preparation it is always necessary to separate the plasmid DNA from the 
large amount of bacterial chromosomal DNA that is also present in the cells. 

Separating the two types of DNA can be very difficult, but is nonetheless essential if 
the plasmids are to be used as cloning vectors. The presence of the smallest amount of 
contaminating bacterial DNA in a gene cloning experiment can easily lead to undesirable 
results. Fortunately several methods are available for removal of bacterial DNA during 
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Figure 3.10 

DNA purification by the guanidinium thiocyanate 
and silica method, (a) In the presence of 
guanidinium thiocyanate, DNA binds to silica 
particles, (b) DNA is purified by column 
chromatography. 
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(b) DNA purification by column chromatography 
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plasmid purification, and the use of these methods, individually or in combination, can 
result in isolation of very pure plasmid DNA. 

The methods are based on the several physical differences between plasmid DNA 
and bacterial DNA, the most obvious of which is size. The largest plasmids are only 8% 
of the size of the E. coli chromosome, and most are much smaller than this. Techniques 
that can separate small DNA molecules from large ones should therefore effectively 
purify plasmid DNA. 

In addition to size, plasmids and bacterial DNA differ in conformation. When applied 
to a polymer such as DNA, the term conformation refers to the overall spatial configura- 
tion of the molecule, with the two simplest conformations being linear and circular. 
Plasmids and the bacterial chromosome are circular, but during preparation of the cell 
extract the chromosome is always broken to give linear fragments. A method for 
separating circular from linear molecules will therefore result in pure plasmids. 
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3.2. 1 Separation on the basis of size 

The usual stage at which size fractionation is performed is during preparation of the cell 
extract. If the cells are lysed under very carefully controlled conditions, only a minimal 
amount of chromosomal DNA breakage occurs. The resulting DNA fragments are still 
very large — much larger than the plasmids — and can be removed with the cell debris 
by centrifugation. This process is aided by the fact that the bacterial chromosome is 
physically attached to the cell envelope, so fragments of the chromosome sediment with 
the cell debris if these attachments are not broken. 

Cell disruption must therefore be carried out very gently to prevent wholesale break- 
age of the bacterial DNA. For E. coli and related species, controlled lysis is performed 
as shown in Figure 3.11. Treatment with EDTA and lysozyme is carried out in the 
presence of sucrose, which prevents the cells from bursting immediately. Instead, 
sphaeroplasts are formed, cells with partially degraded cell walls that retain an 
intact cytoplasmic membrane. Cell lysis is now induced by adding a non-ionic detergent 
such as Triton X-100 (ionic detergents, such as SDS, cause chromosomal breakage). 
This method causes very little breakage of the bacterial DNA, so centrifugation leaves 
a cleared lysate, consisting almost entirely of plasmid DNA. 

A cleared lysate will, however, invariably retain some chromosomal DNA. Further- 
more, if the plasmids themselves are large molecules, they may also sediment with the 
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Figure 3.11 

Preparation of a cleared lysate. 
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Figure 3.12 

Two conformations of circular double-stranded DNA: 

(a) supercoiled— both strands are intact; (b) open-circular 
—one or both strands are nicked. 




(a) Supercoiled 




cell debris. Size fractionation is therefore rarely sufficient on its own, and we must 
consider alternative ways of removing the bacterial DNA contaminants. 

3.2.2 Separation on the basis of conformation 

Before considering the ways in which conformational differences between plasmids and 
bacterial DNA can be used to separate the two types of DNA, we must look more closely 
at the overall structure of plasmid DNA. It is not strictly correct to say that plasmids 
have a circular conformation, because double-stranded DNA circles can take up one 
of two quite distinct configurations. Most plasmids exist in the cell as supercoiled 
molecules (Figure 3.12a). Supercoiling occurs because the double helix of the plasmid 
DNA is partially unwound during the plasmid replication process by enzymes called 
topoisomerases (p. 69). The supercoiled conformation can be maintained only if both 
polynucleotide strands are intact, hence the more technical name of covalently closed- 
circular (ccc) DNA. If one of the polynucleotide strands is broken the double helix 
reverts to its normal relaxed state, and the plasmid takes on the alternative conforma- 
tion, called open-circular (oc) (Figure 3.12b). 

Supercoiling is important in plasmid preparation because supercoiled molecules 
can be fairly easily separated from non-supercoiled DNA. Two different methods are 
commonly used. Both can purify plasmid DNA from crude cell extracts, although in 
practice best results are obtained if a cleared lysate is first prepared. 

Alkaline denaturation 

The basis of this technique is that there is a narrow pH range at which non-supercoiled 
DNA is denatured, whereas supercoiled plasmids are not. If sodium hydroxide is added 
to a cell extract or cleared lysate, so that the pH is adjusted to 12.0-12.5, then the 
hydrogen bonding in non-supercoiled DNA molecules is broken, causing the double 
helix to unwind and the two polynucleotide chains to separate (Figure 3.13). If acid is 
now added, these denatured bacterial DNA strands reaggregate into a tangled mass. 
The insoluble network can be pelleted by centrifugation, leaving plasmid DNA in the 
supernatant. An additional advantage of this procedure is that, under some circum- 
stances (specifically cell lysis by SDS and neutralization with sodium acetate), most of 
the protein and RNA also becomes insoluble and can be removed by the centrifugation 
step. Further purification by organic extraction or column chromatography may there- 
fore not be needed if the alkaline denaturation method is used. 

Ethidium bromide-caesium chloride density gradient centrifugation 

This is a specialized version of the more general technique of equilibrium or density 
gradient centrifugation. A density gradient is produced by centrifuging a solution of 
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Figure 3.13 

Plasmid purification by the alkaline denaturation method. 




Figure 3.14 

Caesium chloride density gradient centrifugation, 
(a) A CsCI density gradient produced by high 
speed centrifugation, (b) Separation of protein, 
DNA, and RNA in a density gradient. 



caesium chloride (CsCI) at a very high speed (Figure 3.14a). Macromolecules present 
in the CsCI solution when it is centrifuged form bands at distinct points in the gradient 
(Figure 3.14b). Exactly where a particular molecule bands depends on its buoyant 
density: DNA has a buoyant density of about 1.70 g/cm 3 , and therefore migrates to the 
point in the gradient where the CsCI density is also 1.70 g/cm 3 . In contrast, protein 
molecules have much lower buoyant densities, and so float at the top of the tube, 
whereas RNA forms a pellet at the bottom (Figure 3.14b). Density gradient centrifuga- 
tion can therefore separate DNA, RNA, and protein and is an alternative to organic 
extraction or column chromatography for DNA purification. 

More importantly, density gradient centrifugation in the presence of ethidium bro- 
mide (EtBr) can be used to separate supercoiled DNA from non-supercoiled molecules. 
Ethidium bromide binds to DNA molecules by intercalating between adjacent base pairs, 
causing partial unwinding of the double helix (Figure 3.15). This unwinding results in 
a decrease in the buoyant density, by as much as 0.125 g/cm 3 for linear DNA. However, 
supercoiled DNA, with no free ends, has very little freedom to unwind, and can only 
bind a limited amount of EtBr. The decrease in buoyant density of a supercoiled 
molecule is therefore much less, only about 0.085 g/cm 3 . As a consequence, supercoiled 
molecules form a band in an EtBr-CsCl gradient at a different position to linear and 
open-circular DNA (Figure 3.16a). 

Ethidium bromide-caesium chloride density gradient centrifugation is a very efficient 
method for obtaining pure plasmid DNA. When a cleared lysate is subjected to this 
procedure, plasmids band at a distinct point, separated from the linear bacterial DNA, 
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Figure 3.15 

Partial unwinding of the DNA double helix by EtBr 
intercalation between adjacent base pairs. The 
normal DNA molecule shown on the left is partially 
unwound by taking up four EtBr molecules, 
resulting in the “stretched” structure on the right. 







(a) An EtBr-CsCI 
density gradient 



(b) Removal of the 
DNA band 



(c) Extraction of the 
EtBr with n-butanol 



(d) Removal of the 
CsCI by dialysis 



Figure 3.16 

Purification of plasmid DNA by EtBr— CsCI density gradient centrifugation. 
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Figure 3.17 

Plasmid amplification. 



with the protein floating on the top of the gradient and RNA pelleted at the bottom. The 
position of the DNA bands can be seen by shining ultraviolet radiation on the tube, which 
causes the bound EtBr to fluoresce. The pure plasmid DNA is removed by puncturing 
the side of the tube and withdrawing a sample with a syringe (Figure 3.16b). The EtBr 
bound to the plasmid DNA is extracted with -butanol (Figure 3.16c) and the CsCl 
removed by dialysis (Figure 3.16d). The resulting plasmid preparation is virtually 100% 
pure and ready for use as a cloning vector. 

3.2.3 Plasmid amplification 

Preparation of plasmid DNA can be hindered by the fact that plasmids make up only 
a small proportion of the total DNA in the bacterial cell. The yield of DNA from a 
bacterial culture may therefore be disappointingly low. Plasmid amplification offers a 
means of increasing this yield. 

The aim of amplification is to increase the copy number of a plasmid. Some multi- 
copy plasmids (those with copy numbers of 20 or more) have the useful property of 
being able to replicate in the absence of protein synthesis. This contrasts with the main 
bacterial chromosome, which cannot replicate under these conditions. This property 
can be utilized during the growth of a bacterial culture for plasmid DNA purification. 
After a satisfactory cell density has been reached, an inhibitor of protein synthesis 
(e.g., chloramphenicol) is added, and the culture incubated for a further 12 hours. 
During this time the plasmid molecules continue to replicate, even though chromosome 
replication and cell division are blocked (Figure 3.17). The result is that plasmid copy 
numbers of several thousand may be attained. Amplification is therefore a very efficient 
way of increasing the yield of multicopy plasmids. 

3.3 Preparation of bacteriophage DNA 

The key difference between phage DNA purification and the preparation of either total 
cell DNA or plasmid DNA is that for phages the starting material is not normally a cell 
extract. This is because bacteriophage particles can be obtained in large numbers 
from the extracellular medium of an infected bacterial culture. When such a culture 
is centrifuged, the bacteria are pelleted, leaving the phage particles in suspension 
(Figure 3.18). The phage particles are then collected from the suspension and their DNA 
extracted by a single deproteinization step to remove the phage capsid. 

This overall process is more straightforward than the procedure used to prepare total 
cell or plasmid DNA. Nevertheless, successful purification of significant quantities of 



40 



The Basic Principles of Gene Cloning and DNA Analysis 



Figure 3.18 

Preparation of a phage suspension from an 
infected culture of bacteria. 
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phage DNA is subject to several pitfalls. The main difficulty, especially with A,, is growing 
an infected culture in such a way that the extracellular phage titer (the number of phage 
particles per ml of culture) is sufficiently high. In practical terms, the maximum titer 
that can reasonably be expected for X is 10 10 per ml; yet 10 10 X particles will yield only 
500 ng of DNA. Large culture volumes, in the range of 500-1000 ml, are therefore 
needed if substantial quantities of X DNA are to be obtained. 

3.3. 1 Growth of cultures to obtain a high A titer 

Growing a large volume culture is no problem (bacterial cultures of 100 liters and over 
are common in biotechnology), but obtaining the maximum phage titer requires a certain 
amount of skill. The naturally occurring X phage is lysogenic (p. 19), and an infected 
culture consists mainly of cells carrying the prophage integrated into the bacterial DNA 
(see Figure 2.7). The extracellular X titer is extremely low under these circumstances. 

To get a high yield of extracellular X, the culture must be induced, so that all the 
bacteria enter the lytic phase of the infection cycle, resulting in cell death and release of 
X particles into the medium. Induction is normally very difficult to control, but most 
laboratory strains of X carry a temperature-sensitive (ts) mutation in the cl gene. This is 
one of the genes that are responsible for maintaining the phage in the integrated state. 
If inactivated by a mutation, the cl gene no longer functions correctly and the switch to 
lysis occurs. In the cits mutation, the cl gene is functional at 30°C, at which temperature 
normal lysogeny can occur. But at 42°C, the cits gene product does not work properly, 
and lysogeny cannot be maintained. A culture of E. coli infected with a X phages carrying 
the cits mutation can therefore be induced to produce extracellular phages by transferring 
from 30°C to 42°C (Figure 3.19). 

3.3.2 Preparation of non-lysogenic A phages 

Although most X strains are lysogenic, many cloning vectors derived from X are 
modified, by deletions of the cl and other genes, so that lysogeny never occurs. These 
phages cannot integrate into the bacterial genome and can infect cells only by a lytic 
cycle (p. 18). 

With these phages the key to obtaining a high titer lies in the way in which the 
culture is grown, in particular the stage at which the cells are infected by adding phage 
particles. If phages are added before the cells are dividing at their maximal rate, then all 
the cells are lysed very quickly, resulting in a low titer (Figure 3.20a). On the other 
hand, if the cell density is too high when the phages are added, then the culture will 
never be completely lysed, and again the phage titer will be low (Figure 3.20b). The 
ideal situation is when the age of the culture, and the size of the phage inoculum, are 
balanced such that the culture continues to grow, but eventually all the cells are infected 
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Figure 3.19 

Induction of a X c\ts lysogen by transferring from 30°C to 42°C. 
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Figure 3.20 

Achieving the right balance between culture age 
and inoculum size when preparing a sample of a 
non-lysogenic phage. 
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Figure 3.21 

Collection of phage particles by 
polyethylene glycol (PEG) precipitation. 
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and lysed (Figure 3.20c). As can be imagined, skill and experience are needed to judge 
the matter to perfection. 

3.3.3 Collection of phages from an infected culture 

The remains of lysed bacterial cells, along with any intact cells that are inadvertently left 
over, can be removed from an infected culture by centrifugation, leaving the phage 
particles in suspension (see Figure 3.18). The problem now is to reduce the size of the 
suspension to 5 ml or less, a manageable size for DNA extraction. 

Phage particles are so small that they are pelleted only by very high speed centrifuga- 
tion. Collection of phages is therefore usually achieved by precipitation with polyethy- 
lene glycol (PEG). This is a long-chain polymeric compound which, in the presence of 
salt, absorbs water, thereby causing macromolecular assemblies such as phage particles 
to precipitate. The precipitate can then be collected by centrifugation, and redissolved 
in a suitably small volume (Figure 3.21). 

3.3.4 Purification of DNA from A phage particles 

Deproteinization of the redissolved PEG precipitate is sometimes sufficient to extract 
pure phage DNA, but usually X phages are subjected to an intermediate purification 
step. This is necessary because the PEG precipitate also contains a certain amount of 
bacterial debris, possibly including unwanted cellular DNA. These contaminants can be 
separated from the X particles by CsCl density gradient centrifugation. The X particles 
band in a CsCl gradient at 1.45-1.50 g/cm 3 (Figure 3.22), and can be withdrawn from 
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Purification of X phage particles by 
CsCl density gradient centrifugation. 
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the gradient as described previously for DNA bands (see Figure 3.16). Removal of CsCl 
by dialysis leaves a pure phage preparation from which the DNA can be extracted by 
either phenol or protease treatment to digest the phage protein coat. 



3.3.5 Purification of Ml 3 DNA causes few problems 

Most of the differences between the M13 and X infection cycles are to the advantage 
of the molecular biologist wishing to prepare M13 DNA. First, the double-stranded 
replicative form of M13 (p. 23), which behaves like a high copy number plasmid, is 
very easily purified by the standard procedures for plasmid preparation. A cell extract 
is prepared from cells infected with M13, and the replicative form separated from 
bacterial DNA by, for example, EtBr-CsCl density gradient centrifugation. 

However, the single-stranded form of the Ml 3 genome, contained in the extracellu- 
lar phage particles, is frequently required. In this respect the big advantage compared 
with X is that high titers of Ml 3 are very easy to obtain. As infected cells continually 
secrete M13 particles into the medium (see Figure 2.8), with lysis never occurring, a 
high Ml 3 titer is achieved simply by growing the infected culture to a high cell density. 
In fact, titers of 10 12 per ml and above are quite easy to obtain without any special tricks 
being used. Such high titers mean that significant amounts of single-stranded M13 DNA 
can be prepared from cultures of small volume — 5 ml or less. Furthermore, as the 
infected cells are not lysed, there is no problem with cell debris contaminating the phage 
suspension. Consequently the CsCl density gradient centrifugation step, needed for X 
phage preparation, is rarely required with M13. 

In summary, single-stranded M13 DNA preparation involves growth of a small 
volume of infected culture, centrifugation to pellet the bacteria, precipitation of the 
phage particles with PEG, phenol extraction to remove the phage protein coats, and 
ethanol precipitation to concentrate the resulting DNA (Figure 3.23). 
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Figure 3.23 

Preparation of single-stranded Ml 3 DNA from an infected culture of bacteria. 
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Once pure samples of DNA have been prepared, the next step in a gene cloning experi- 
ment is construction of the recombinant DNA molecule (see Figure 1.1). To produce 
this recombinant molecule, the vector, as well as the DNA to be cloned, must be cut at 
specific points and then joined together in a controlled manner. Cutting and joining are 
two examples of DNA manipulative techniques, a wide variety of which have been 
developed over the past few years. As well as being cut and joined, DNA molecules can 
be shortened, lengthened, copied into RNA or into new DNA molecules, and modified 
by the addition or removal of specific chemical groups. These manipulations, all of 
which can be carried out in the test tube, provide the foundation not only for gene 
cloning, but also for studies of DNA biochemistry, gene structure, and the control of 
gene expression. 

Almost all DNA manipulative techniques make use of purified enzymes. Within the cell 
these enzymes participate in essential processes such as DNA replication and transcrip- 
tion, breakdown of unwanted or foreign DNA (e.g., invading virus DNA), repair of 
mutated DNA, and recombination between different DNA molecules. After purifica- 
tion from cell extracts, many of these enzymes can be persuaded to carry out their 
natural reactions, or something closely related to them, under artificial conditions. 
Although these enzymatic reactions are often straightforward, most are absolutely 
impossible to perform by standard chemical methods. Purified enzymes are therefore 
crucial to genetic engineering and an important industry has sprung up around their 
preparation, characterization, and marketing. Commercial suppliers of high purity 
enzymes provide an essential service to the molecular biologist. 
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The cutting and joining manipulations that underlie gene cloning are carried out by 
enzymes called restriction endonucleases (for cutting) and ligases (for joining). Most of 
this chapter will be concerned with the ways in which these two types of enzyme are 
used. First, however, we must consider the whole range of DNA manipulative enzymes, 
to see exactly what types of reaction can be performed. Many of these enzymes will be 
mentioned in later chapters when procedures that make use of them are described. 

4.1 The range of DNA manipulative enzymes 

DNA manipulative enzymes can be grouped into four broad classes, depending on the 
type of reaction that they catalyze: 

• Nucleases are enzymes that cut, shorten, or degrade nucleic acid molecules. 

• Ligases join nucleic acid molecules together. 

• Polymerases make copies of molecules. 

• Modifying enzymes remove or add chemical groups. 

Before considering in detail each of these classes of enzyme, two points should be 
made. The first is that, although most enzymes can be assigned to a particular class, a 
few display multiple activities that span two or more classes. Most importantly, many 
polymerases combine their ability to make new DNA molecules with an associated DNA 
degradative (i.e., nuclease) activity. 

Second, it should be appreciated that, as well as the DNA manipulative enzymes, 
many similar enzymes able to act on RNA are known. The ribonuclease used to remove 
contaminating RNA from DNA preparations (p. 30) is an example of such an enzyme. 
Although some RNA manipulative enzymes have applications in gene cloning and will 
be mentioned in later chapters, we will in general restrict our thoughts to those enzymes 
that act on DNA. 

4 . 7.7 Nucleases 

Nucleases degrade DNA molecules by breaking the phosphodiester bonds that link one 
nucleotide to the next in a DNA strand. There are two different kinds of nuclease 
(Figure 4.1): 

► Exonucleases remove nucleotides one at a time from the end of a DNA molecule. 
Endonucleases are able to break internal phosphodiester bonds within a DNA molecule. 

The main distinction between different exonucleases lies in the number of strands 
that are degraded when a double-stranded molecule is attacked. The enzyme called 
Bal3 1 (purified from the bacterium Alteromonas espejiana) is an example of an exo- 
nuclease that removes nucleotides from both strands of a double-stranded molecule 
(Figure 4.2a). The greater the length of time that Bal31 is allowed to act on a group of 
DNA molecules, the shorter the resulting DNA fragments will be. In contrast, enzymes 
such as E. coli exonuclease III degrade just one strand of a double-stranded molecule, 
leaving single-stranded DNA as the product (Figure 4.2b). 

The same criterion can be used to classify endonucleases. SI endonuclease (from 
the fungus Aspergillus oryzae) only cleaves single strands (Figure 4.3a), whereas 
deoxyribonuclease I (DNase I), which is prepared from cow pancreas, cuts both single- 
and double-stranded molecules (Figure 4.3b). DNase I is non-specific in that it attacks 
DNA at any internal phosphodiester bond, so the end result of prolonged DNase I 
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Figure 4.1 

The reactions catalyzed by the 
two different kinds of nuclease. 

(a) An exonuclease, which removes 
nucleotides from the end of a DNA 
molecule, (b) An endonuclease, which 
breaks internal phosphodiester bonds. 
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Figure 4.2 

The reactions catalyzed by different types of exonuclease, (a) Bal31 , which removes nucleotides from both strands of a 
double-stranded molecule, (b) Exonuclease III, which removes nucleotides only from the 3' terminus. 



action is a mixture of mononucleotides and very short oligonucleotides. On the other 
hand, the special group of enzymes called restriction endonucleases cleave double- 
stranded DNA only at a limited number of specific recognition sites (Figure 4.3c). These 
important enzymes are described in detail on p. 50. 

4.7.2 Ligases 

In the cell the function of DNA ligase is to repair single-stranded breaks (“discontinuities”) 
that arise in double-stranded DNA molecules during, for example, DNA replication. 
DNA ligases from most organisms can also join together two individual fragments of 
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Figure 4.3 

The reactions catalyzed by different types 
of endonuclease, (a) SI nuclease, which 
cleaves only single-stranded DNA, 
including single-stranded nicks in mainly 
double-stranded molecules, (b) DNase I, 
which cleaves both single- and 
double-stranded DNA. (c) A restriction 
endonuclease, which cleaves 
double-stranded DNA, but only at 
a limited number of sites. 
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Figure 4.4 

The two reactions catalyzed by DNA ligase. 

(a) Repair of a discontinuity— a missing 
phosphodiester bond in one strand of a double- 
stranded molecule, (b) Joining two molecules 
together. 
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double-stranded DNA (Figure 4.4). The role of these enzymes in construction of recom- 
binant DNA molecules is described on p. 63. 

4.7.3 Polymerases 

DNA polymerases are enzymes that synthesize a new strand of DNA complementary to 
an existing DNA or RNA template (Figure 4.5a). Most polymerases can function only 
if the template possesses a double-stranded region that acts as a primer for initiation of 
polymerization. 

Four types of DNA polymerase are used routinely in genetic engineering. The first is 
DNA polymerase I, which is usually prepared from E. coli. This enzyme attaches to a 
short single-stranded region (or nick) in a mainly double-stranded DNA molecule, and 
then synthesizes a completely new strand, degrading the existing strand as it proceeds 
(Figure 4.5b). DNA polymerase I is therefore an example of an enzyme with a dual 
activity — DNA polymerization and DNA degradation. 

The polymerase and nuclease activities of DNA polymerase I are controlled by dif- 
ferent parts of the enzyme molecule. The nuclease activity is contained in the first 323 
amino acids of the polypeptide, so removal of this segment leaves a modified enzyme 
that retains the polymerase function but is unable to degrade DNA. This modified 
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Figure 4.5 

The reactions catalyzed by DNA polymerases, (a) The basic reaction: a new DNA strand is synthesized in the 
5' to 3' direction, (b) DNA polymerase I, which initially fills in nicks but then continues to synthesize a new strand, 
degrading the existing one as it proceeds, (c) The Klenow fragment, which only fills in nicks, (d) Reverse transcriptase, 
which uses a template of RNA. 



enzyme, called the Klenow fragment, can still synthesize a complementary DNA strand 
on a single-stranded template, but as it has no nuclease activity it cannot continue the 
synthesis once the nick is filled in (Figure 4.5c). Several other enzymes — natural poly- 
merases and modified versions — have similar properties to the Klenow fragment. The 
major application of these polymerases is in DNA sequencing (p. 166). 

The Taq DNA polymerase used in the polymerase chain reaction (PCR) (see Figure 1.2) 
is the DNA polymerase I enzyme of the bacterium Thermus aquaticus. This organism 
lives in hot springs, and many of its enzymes, including the Taq DNA polymerase, are 
thermostable, meaning that they are resistant to denaturation by heat treatment. This 
is the special feature of Taq DNA polymerase that makes it suitable for PCR, because if 
it was not thermostable it would be inactivated when the temperature of the reaction is 
raised to 94°C to denature the DNA. 

The final type of DNA polymerase that is important in genetic engineering is reverse 
transcriptase, an enzyme involved in the replication of several kinds of virus. Reverse 
transcriptase is unique in that it uses as a template not DNA but RNA (Figure 4.5d). The 
ability of this enzyme to synthesize a DNA strand complementary to an RNA template 
is central to the technique called complementary DNA (cDNA) cloning (p. 133). 



4. 7.4 DNA modifying enzymes 

There are numerous enzymes that modify DNA molecules by addition or removal of 
specific chemical groups. The most important are as follows: 
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(a) Alkaline phosphatase 




(b) Polynucleotide kinase 




(c) Terminal deoxynucleotidyl transferase 




Figure 4.6 

The reactions catalyzed by DNA modifying enzymes, (a) Alkaline phosphatase, which removes 5'-phosphate groups. 

(b) Polynucleotide kinase, which attaches 5'-phosphate groups, (c) Terminal deoxynucleotidyl transferase, which attaches 
deoxyribonucleotides to the 3' termini of polynucleotides in either (i) single-stranded or (ii) double-stranded molecules. 

Alkaline phosphatase (from E. coli , calf intestinal tissue, or arctic shrimp), which 
removes the phosphate group present at the 5' terminus of a DNA molecule 
(Figure 4.6a). 

Polynucleotide kinase (from E. coli infected with T4 phage), which has the reverse 
effect to alkaline phosphatase, adding phosphate groups onto free 5' termini 
(Figure 4.6b). 

• Terminal deoxynucleotidyl transferase (from calf thymus tissue), which adds one or 
more deoxyribonucleotides onto the 3' terminus of a DNA molecule (Figure 4.6c). 

4.2 Enzymes for cutting DNA — restriction 
endonucleases 

Gene cloning requires that DNA molecules be cut in a very precise and reproducible 
fashion. This is illustrated by the way in which the vector is cut during construction of 
a recombinant DNA molecule (Figure 4.7a). Each vector molecule must be cleaved at a 
single position, to open up the circle so that new DNA can be inserted: a molecule that 
is cut more than once will be broken into two or more separate fragments and will be 
of no use as a cloning vector. Furthermore, each vector molecule must be cut at exactly 
the same position on the circle — as will become apparent in later chapters, random 
cleavage is not satisfactory. It should be clear that a very special type of nuclease is 
needed to carry out this manipulation. 

Often it is also necessary to cleave the DNA that is to be cloned (Figure 4.7b). There 
are two reasons for this. First, if the aim is to clone a single gene, which may consist of 
only 2 or 3 kb of DNA, then that gene will have to be cut out of the large (often greater 
than 80 kb) DNA molecules produced by skilfull use of the preparative techniques 
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(a) Vector molecules 




Figure 4.7 
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a gene cloning experiment. 
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described in Chapter 3. Second, large DNA molecules may have to be broken down 
simply to produce fragments small enough to be carried by the vector. Most cloning 
vectors exhibit a preference for DNA fragments that fall into a particular size range: 
most plasmid-based vectors, for example, are very inefficient at cloning DNA molecules 
more than 8 kb in length. 

Purified restriction endonucleases allow the molecular biologist to cut DNA mole- 
cules in the precise, reproducible manner required for gene cloning. The discovery of 
these enzymes, which led to Nobel Prizes for W. Arber, H. Smith, and D. Nathans in 
1978, was one of the key breakthroughs in the development of genetic engineering. 

4.2 . 1 The discovery and function of restriction endonucleases 

The initial observation that led to the eventual discovery of restriction endonucleases was 
made in the early 1950s, when it was shown that some strains of bacteria are immune 
to bacteriophage infection, a phenomenon referred to as host-controlled restriction. 

The mechanism of restriction is not very complicated, even though it took over 
20 years to be fully understood. Restriction occurs because the bacterium produces 
an enzyme that degrades the phage DNA before it has time to replicate and direct syn- 
thesis of new phage particles (Figure 4.8a). The bacterium’s own DNA, the destruction 
of which would of course be lethal, is protected from attack because it carries addi- 
tional methyl groups that block the degradative enzyme action (Figure 4.8b). 

These degradative enzymes are called restriction endonucleases and are synthesized 
by many, perhaps all, species of bacteria: over 2500 different ones have been isolated 
and more than 300 are available for use in the laboratory. Three different classes of 
restriction endonuclease are recognized, each distinguished by a slightly different mode 
of action. Types I and III are rather complex and have only a limited role in genetic 



52 



art I The Basic Principles of Gene Cloning and DNA Analysis 



(a) Restriction of phage DNA 




(b) Bacterial DNA is not cleaved 




The function of a restriction endonuclease in a bacterial cell: (a) phage DNA is cleaved, but (b) bacterial DNA is not. 



engineering. Type II restriction endonucleases, on the other hand, are the cutting 
enzymes that are so important in gene cloning. 

4.2.2 Type II restriction endonucleases cut DNA at specific 
nucleotide sequences 

The central feature of type II restriction endonucleases (which will be referred to simply 
as “restriction endonucleases” from now on) is that each enzyme has a specific recognition 
sequence at which it cuts a DNA molecule. A particular enzyme cleaves DNA at the 
recognition sequence and nowhere else. For example, the restriction endonuclease called 
Pvul (isolated from Proteus vulgaris) cuts DNA only at the hexanucleotide CGATCG. 
In contrast, a second enzyme from the same bacterium, called Pvull, cuts at a different 
hexanucleotide, in this case CAGCTG. 

Many restriction endonucleases recognize hexanucleotide target sites, but others cut 
at four, five, eight, or even longer nucleotide sequences. Sau3A (from Staphylococcus 
aureus strain 3 A) recognizes GATC, and Alul ( Arthrobacter luteus) cuts at AGCT. There 
are also examples of restriction endonucleases with degenerate recognition sequences, 
meaning that they cut DNA at any one of a family of related sites. Hinfl (Haemophilus 
influenzae strain R f ), for instance, recognizes GANTC, so cuts at GAATC, GATTC, 
GAGTC, and GACTC. The recognition sequences for some of the most frequently used 
restriction endonucleases are listed in Table 4.1. 



Manipulation of Purified DNA 



53 



Table 4.1 

The recognition sequences for some of the most frequently used restriction endonucleases. 




*The sequence shown is that of one strand, given in the 5' to 3' direction. “N” indicates any nucleotide. Note that almost all recognition sequences are 
palindromes: when both strands are considered they read the same in each direction, for example: 

5-GAATTC-3' 

EcoRI | | | 1 1 | 

3'-CTTAAG-5' 

4.2.3 Blunt ends and sticky ends 

The exact nature of the cut produced by a restriction endonuclease is of considerable 
importance in the design of a gene cloning experiment. Many restriction endonucleases 
make a simple double-stranded cut in the middle of the recognition sequence (Figure 4.9a), 
resulting in a blunt end or flush end. Pvull and Alul are examples of blunt end cutters. 

Other restriction endonucleases cut DNA in a slightly different way. With these 
enzymes the two DNA strands are not cut at exactly the same position. Instead the 
cleavage is staggered, usually by two or four nucleotides, so that the resulting DNA frag- 
ments have short single-stranded overhangs at each end (Figure 4.9b). These are called 
sticky or cohesive ends, as base pairing between them can stick the DNA molecule back 
together again (recall that sticky ends were encountered on p. 20 during the description 
of X phage replication). One important feature of sticky end enzymes is that restriction 
endonucleases with different recognition sequences may produce the same sticky ends. 
BamHl (recognition sequence GGATCC) and Bglll (AGATCT) are examples — both 
produce GATC sticky ends (Figure 4.9c). The same sticky end is also produced by 
Sau3A, which recognizes only the tetranucleotide GATC. Fragments of DNA produced 
by cleavage with either of these enzymes can be joined to each other, as each fragment 
carries a complementary sticky end. 



4.2.4 The frequency of recognition sequences in a DNA 

molecule 

The number of recognition sequences for a particular restriction endonuclease in a DNA 
molecule of known length can be calculated mathematically. A tetranucleotide sequence 
(e.g., GATC) should occur once every 4 4 = 256 nucleotides, and a hexanucleotide 
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(a) Production of blunt ends 

-N-N-A-G-C-T-N-N- A!u\ -N-N-A-G C-T-N-N- 

-N-N-T-C-G-A-N-N- -N-N-T-C G-A-N-N- 

I / 

‘N’ = A, G, C, or T Blunt ends 



(b) Production of sticky ends 

-N-N-G-A-A-T-T-C-N-N- EcoRI ^ -N-N-G A-A-T- T-C-N-N- 
-N-N-C-T-T-A-A-G-N-N- -N-N-C-T-T-A-A | G-N-N- 

Sticky ends 

(c) The same sticky ends produced by 
different restriction endonucleases 

„ IM -N-N-G G-A-T-C-C-N-N- 

BamHI • • • ... 

-N-N-C-C-T-A-G G-N-N- 



Bgl II 



-N-N-A G-A-T-C-T-N-N- 

-N-N-T-C-T-A-G A-N-N- 



Sau3A 



-N-N-N 

-N-N-N-C-T-A-G 



G-A-T-C-N-N-N- 

N-N-N- 



Figure 4.9 

The ends produced by cleavage of DNA with different restriction endonucleases, (a) A blunt end produced by Alu\. 

(b) A sticky end produced by EcoRI. (c) The same sticky ends produced by BawHI, BglW and Sat/3A. 

(e.g., GGATCC) once every 4 6 = 4096 nucleotides. These calculations assume that the 
nucleotides are ordered in a random fashion and that the four different nucleotides are 
present in equal proportions (i.e., the GC content = 50%). In practice, neither of these 
assumptions is entirely valid. For example, the A, DNA molecule, at 49 kb, should 
contain about 12 sites for a restriction endonuclease with a hexanucleotide recogni- 
tion sequence. In fact, many of these recognition sites occur less frequently (e.g., six 
for Bglll, five for BamHl, and only two for Sail), a reflection of the fact that the GC 
content for X is rather less than 50% (Figure 4.10a). 

Furthermore, restriction sites are generally not evenly spaced out along a DNA 
molecule. If they were, then digestion with a particular restriction endonuclease would 
give fragments of roughly equal sizes. Figure 4.10b shows the fragments produced by 
cutting X DNA with BglW, BamHl , and Sail. In each case there is a considerable spread 
of fragment sizes, indicating that in X DNA the nucleotides are not randomly ordered. 

The lesson to be learned from Figure 4.10 is that although mathematics may give an 
idea of how many restriction sites to expect in a given DNA molecule, only experi- 
mental analysis can provide the true picture. We must therefore move on to consider 
how restriction endonucleases are used in the laboratory. 



4.2.5 Performing a restriction digest in the laboratory 

As an example, we will consider how to digest a sample of X DNA (concentration 
125 mg/ml) with Bglll. 

First, the required amount of DNA must be pipetted into a test tube. The amount of 
DNA that will be restricted depends on the nature of the experiment. In this case we will 
digest 2 pg of A, DNA, which is contained in 16 jllI of the sample (Figure 4.11a). Very 
accurate micropipettes will therefore be needed. 
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(a) Cleavage sites on X DNA 



o l 



10 



A 



20 



A 



30 

A 




(b) Fragment sizes 
BglW 

I 1 22 010 

I 113 286 

I 1 9688 

I 1 2392 

11651 

11415 

n 60 



49 kb 



BglW - 6 sites 
8amHI - 5 sites 
i Sail - 2 sites 



8am HI 

I 116 841 

I 1 7233 

I 1 6770 

I 16527 

I 1 5626 

I 1 5505 



Sal I 



11499 



115 258 



1 32 745 



Figure 4.10 

Restriction of the X DNA molecule, (a) The positions of the recognition sequences tor BglW, BamH\, and Sal\. (b) The 
fragments produced by cleavage with each of these restriction endonucleases. The numbers are the fragment sizes in 
base pairs. 




Figure 4.11 

Performing a restriction digest in the laboratory. 



The other main component in the reaction will be the restriction endonuclease, 
obtained from a commercial supplier as a pure solution of known concentration. But 
before adding the enzyme, the solution containing the DNA must be adjusted to pro- 
vide the correct conditions to ensure maximal activity of the enzyme. Most restriction 
endonucleases function adequately at pH 7.4, but different enzymes vary in their 
requirements for ionic strength (usually provided by sodium chloride (NaCl)) and 
magnesium (Mg 2+ ) concentration (all type II restriction endonucleases require Mg 2+ in 
order to function). It is also advisable to add a reducing agent, such as dithiothreitol 
(DTT), which stabilizes the enzyme and prevents its inactivation. Providing the right 
conditions for the enzyme is very important — incorrect NaCl or Mg 2+ concentrations 
not only decrease the activity of the restriction endonuclease, they might also cause 
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Table 4.2 

A 10 x buffer suitable for restriction of DNA with BglW. 




changes in the specificity of the enzyme, so that DNA cleavage occurs at additional, 
non-standard recognition sequences. 

The composition of a suitable buffer for Bglll is shown in Table 4.2. This buffer is ten 
times the working concentration, and is diluted by being added to the reaction mixture. 
In our example, a suitable final volume for the reaction mixture would be 20 pi, so we 
add 2 pi of 10 x Bglll buffer to the 16 pi of DNA already present (Figure 4.11b). 

The restriction endonuclease can now be added. By convention, 1 unit of enzyme 
is defined as the quantity needed to cut 1 pg of DNA in 1 hour, so we need 2 units 
of Bglll to cut 2 pg of A, DNA. Bglll is frequently obtained at a concentration of 
4 units/pl, so 0.5 pi is sufficient to cleave the DNA. The final ingredients in the reac- 
tion mixture are therefore 0.5 pi Bglll + 1.5 pi water, giving a final volume of 20 pi 
(Figure 4.11c). 

The last factor to consider is incubation temperature. Most restriction endonu- 
cleases, including Bglll, work best at 37°C, but a few have different requirements. 
Taql, for example, is a restriction enzyme from Thermus aquations and, like Taq DNA 
polymerase, has a high working temperature. Restriction digests with Taql must be 
incubated at 65 °C to obtain maximum enzyme activity. 

After 1 hour the restriction should be complete (Figure 4. lid). If the DNA fragments 
produced by restriction are to be used in cloning experiments, the enzyme must some- 
how be destroyed so that it does not accidentally digest other DNA molecules that may 
be added at a later stage. There are several ways of “killing” the enzyme. For many a 
short incubation at 70°C is sufficient, for others phenol extraction or the addition of 
ethylenediamine tetraacetate (EDTA), which binds Mg 2+ ions preventing restriction 
endonuclease action, is used (Figure 4. lie). 

4.2.6 Analyzing the result of restriction endonuclease 
cleavage 

A restriction digest results in a number of DNA fragments, the sizes of which depend 
on the exact positions of the recognition sequences for the endonuclease in the original 
molecule (see Figure 4.10). A way of determining the number and sizes of the fragments 
is needed if restriction endonucleases are to be of use in gene cloning. Whether or not a 
DNA molecule is cut at all can be determined fairly easily by testing the viscosity of the 
solution. Larger DNA molecules result in a more viscous solution than smaller ones, so 
cleavage is associated with a decrease in viscosity. However, working out the number and 
sizes of the individual cleavage products is more difficult. In fact, for several years this was 
one of the most tedious aspects of experiments involving DNA. Eventually the problems 
were solved in the early 1970s when the technique of gel electrophoresis was developed. 
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Figure 4.12 

(a) Standard electrophoresis does not separate 
DNA fragments of different sizes, whereas 

(b) gel electrophoresis does. 




into size classes occurs 



(a) Standard electrophoresis 







(b) Gel electrophoresis 



DNA, loaded into 
a well cut out of 
the gel Gel 



nir 



Buffer 



Electrophorese 



n r 



■ i i i 



DNA separates into bands 
of different-sized fragments 



Smallest 



Separation of molecules by gel electrophoresis 

Electrophoresis, like ion-exchange chromatography (see p. 30), is a technique that uses 
differences in electrical charge to separate the molecules in a mixture. DNA molecules 
have negative charges, and so when placed in an electric field they migrate toward 
the positive pole (Figure 4.12a). The rate of migration of a molecule depends on two 
factors, its shape and its charge-to-mass ratio. Unfortunately, most DNA molecules are 
the same shape and all have very similar charge-to-mass ratios. Fragments of different 
sizes cannot therefore be separated by standard electrophoresis. 

The size of the DNA molecule does, however, become a factor if the electro- 
phoresis is performed in a gel. A gel, which is usually made of agarose, polyacrylamide, 
or a mixture of the two, comprises a complex network of pores, through which the 
DNA molecules must travel to reach the positive electrode. The smaller the DNA mole- 
cule, the faster it can migrate through the gel. Gel electrophoresis therefore separates 
DNA molecules according to their size (Figure 4.12b). 

In practice the composition of the gel determines the sizes of the DNA molecules 
that can be separated. A 0.5 cm thick slab of 0.5% agarose, which has relatively large 
pores, would be used for molecules in the size range 1-30 kb, allowing, for example, 
molecules of 10 and 12 kb to be clearly distinguished. At the other end of the scale, a 
very thin (0.3 mm) 40% polyacrylamide gel, with extremely small pores, would be used 
to separate much smaller DNA molecules, in the range 1-300 bp, and could distinguish 
molecules differing in length by just a single nucleotide. 
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Figure 4.13 

Visualizing DNA bands in an agarose gel by EtBr 
staining and ultraviolet (UV) irradiation. 



Wells for samples 





UV 



Visualizing DNA molecules in an agarose gel 

The easiest way to see the results of a gel electrophoresis experiment is to stain the gel 
with a compound that makes the DNA visible. Ethidium bromide (EtBr), already 
described on p. 37 as a means of visualizing DNA in caesium chloride gradients, is also 
routinely used to stain DNA in agarose and polyacrylamide gels (Figure 4.13). Bands 
showing the positions of the different size classes of DNA fragment are clearly visible 
under ultraviolet irradiation after EtBr staining, so long as sufficient DNA is present. 
Unfortunately, the procedure is very hazardous because ethidium bromide is a power- 
ful mutagen. EtBr staining also has limited sensitivity, and if a band contains less than 
about 10 ng of DNA then it might not be visible after staining. 

For this reason, non-mutagenic dyes that stain DNA green, red, or blue are now 
used in many laboratories. Most of these dyes can be used either as a post-stain after 
electrophoresis, as illustrated in Figure 4.13 for EtBr, or alternatively, because they 
are non-hazardous, they can be included in the buffer solution in which the agarose 
or polyacrylamide is dissolved when the gel is prepared. Some of these dyes require 
ultraviolet irradiation in order to make the bands visible, but others are visualized by 
illumination at other wavelengths, for example under blue light, removing a second 
hazard as ultraviolet radiation can cause severe burns. The most sensitive dyes are able 
to detect bands that contain less than 1 ng DNA. 

4.2. 7 Estimation of the sizes of DNA molecules 

Gel electrophoresis separates different sized DNA molecules, with the smallest molecules 
traveling the greatest distance toward the positive electrode. If several DNA fragments of 
varying sizes are present (the result of a successful restriction digest, for example), then a 
series of bands appears in the gel. How can the sizes of these fragments be determined? 

The most accurate method is to make use of the mathematical relationship that links 
migration rate to molecular mass. The relevant formula is: 

D - a - 6(logM) 
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(a) Rough estimation by eye 



H/ndlll Unknown 

X 




c. 5000 bp 
c. 3200 bp 

c. 2000 bp 



Figure 4.14 

Estimation of the sizes of DNA fragments in an agarose gel. 

(a) A rough estimate of fragment size can be obtained by eye. 

(b) A more accurate measurement of fragment size is gained 
by using the mobilities of the Hind III— A, fragments to construct 
a calibration curve; the sizes of the unknown fragments can 
then be determined from the distances they have migrated. 



(b) Accurate graphical estimation 




where D is the distance moved, M is the molecular mass, and a and b are constants that 
depend on the electrophoresis conditions. 

Because extreme accuracy in estimating DNA fragment sizes is not always necessary, 
a much simpler though less precise method is more generally used. A standard restric- 
tion digest, comprising fragments of known size, is usually included in each electro- 
phoresis gel that is run. Restriction digests of A, DNA are often used in this way as size 
markers. For example, Hindlll cleaves X DNA into eight fragments, ranging in size from 
125 bp for the smallest to over 23 kb for the largest. As the sizes of the fragments in this 
digest are known, the fragment sizes in the experimental digest can be estimated by 
comparing the positions of the bands in the two tracks (Figure 4.14). Special mixtures 
of DNA fragments called DNA ladders, whose sizes are multiples of 100 bp or of 1 kb, 
can also be used as size markers. Although not precise, size estimation by comparison 
with DNA markers can be performed with as little as a 5% error, which is satisfactory 
for most purposes. 

4.2 . 8 Mapping the positions of different restriction sites in a 
DNA molecule 

So far we have considered how to determine the number and sizes of the DNA fragments 
produced by restriction endonuclease cleavage. The next step in restriction analysis 
is to construct a map showing the relative positions in the DNA molecule of the 
recognition sequences for a number of different enzymes. Only when a restriction map 
is available can the correct restriction endonucleases be selected for the particular 
cutting manipulation that is required (Figure 4.15). 
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Figure 4.15 

Using a restriction map to work out which 
restriction endonucleases should be used to obtain 
DNA fragments containing individual genes. 



Genetic map 

Gene A Gene B Gene C Gene D 

TT 

Restriction map 

6eg/ll A BamHI i Sa/I 



To obtain gene B, digest with Bgl II 




To obtain gene D, digest with BamHI + Sal I 





To construct a restriction map, a series of restriction digests must be performed. First, 
the number and sizes of the fragments produced by each restriction endonuclease 
must be determined by gel electrophoresis followed by comparison with size markers 
(Figure 4.16). This information must then be supplemented by a series of double 
digestions, in which the DNA is cut by two restriction endonucleases at once. It might 
be possible to perform a double digestion in one step if both enzymes have similar 
requirements for pH, Mg 2+ concentration, etc. Alternatively, the two digestions may 
have to be carried out one after the other, adjusting the reaction mixture after the first 
digestion to provide a different set of conditions for the second enzyme. 

Comparing the results of single and double digests will allow many, if not all, of the 
restriction sites to be mapped (Figure 4.16). Ambiguities can usually be resolved by 
partial digestion, carried out under conditions that result in cleavage of only a limited 
number of the restriction sites on any DNA molecule. Partial digestion is usually 
achieved by reducing the incubation period, so the enzyme does not have time to cut all 
the restriction sites, or by incubating at a low temperature (e.g., 4°C rather than 37°C), 
which limits the activity of the enzyme. 

The result of a partial digestion is a complex pattern of bands in an electrophoresis 
gel. As well as the standard fragments, produced by total digestion, additional sizes are 
seen. These are molecules that comprise two adjacent restriction fragments, separated 
by a site that has not been cleaved. Their sizes indicate which restriction fragments in 
the complete digest are next to one another in the uncut molecule (Figure 4.16). 

4.2.9 Special gel electrophoresis methods for separating 
larger molecules 

During agarose gel electrophoresis, a DNA fragment migrates at a rate that is propor- 
tional to its size, but this relationship is not a direct one. The formula that links 
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Single and double digestions 



Enzyme 


Number of fragments 


Sizes (kb) 


Xba\ 


2 


24.0, 24.5 


Xho\ 


2 


15.0, 33.5 


Kpn\ 


3 


1.5, 17.0, 30.0 


Xba\ + Xho\ 


3 


9.0, 15.0, 24.5 


Xba\ + Kpn\ 


4 


1.5, 6.0,17.0,24.0 



Conclusions: 

(1) As X DNA is linear, the number of restriction sites for each enzyme is 
Xba\ 1 , Xho\ 1, Kpnl 2. 

(2) The Xba\ and Xho\ sites can be mapped: 

Xba\ fragments i 24 -° i i 24 - 5 i 

Xba\ - Xho\ fragments i 9.0 i i 15.0 i | 24.5 

Xho\ fragments I 15 -° I I 33 - 5 I 

Xho\ Xba\ 

The only possibility is: I I 

15.0 9.0 24.5 

(3) All the Kpnl sites fall in the 24.5 kb Xba\ fragment, as the 24.0 kb fragment 
is intact after Xbal-Kpnl double digestion. The order of the Kpnl fragments 
can be determined only by partial digestion. 



Partial digestion 


Enzyme 


Fragment sizes (kb) 


Kpnl - limiting conditions 


1.5, 17.0, 18.5, 30.0, 31.5, 48.5 



Conclusions: 

(1) 48.5 kb fragment = uncut X. 

(2) 1 .5, 17.0 and 30.0 kb fragments are products of complete digestion. 

(3) 18.5 and 31 .5 kb fragments are products of partial digestion. 

Kpn Is 

The Kpnl map must be: l_l 

30.0 1.5 17.0 

Xho\ Xba\ Kpn\s 

Therefore the complete map is: I I l_l 

15.0 9.0 6.01.5 17.0 

Figure 4,16 

Restriction mapping. This example shows how the positions of \heXba\, Xho\ and Kpn\ sites on the X DNA molecule can 
be determined. 



migration rate to molecular mass has a logarithmic component (p. 59), which means 
that the difference in migration rates become increasingly small for larger molecules 
(Figure 4.17). In practice, molecules larger than about 50 kb cannot be resolved effici- 
ently by standard gel electrophoresis. 

This size limitation is not usually a problem when the restriction fragments being 
studied have been obtained by cutting the DNA with an enzyme with a tetranucleotide 
or hexanucleotide recognition sequence. Most, if not all, of the fragments produced 
in this way will be less than 30 kb in length and easily resolved by agarose gel electro- 
phoresis. Difficulties might arise, however, if an enzyme with a longer recogni- 
tion sequence is used, such as Notl, which cuts at an eight-nucleotide sequence (see 
Table 4.1). Notl would be expected, on average, to cut a DNA molecule once every 



